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ABSTRACT 

One of the most challenging issues of evolutionary biology concerns speciation, the emer- 
gence of new species from an initial one. The huge amount of species found in nature 
demands a simple and robust mechanism. Yet, no consensus has been reached concerning a 
reasonable disruptive selection mechanism that prevents mixing genes among the emerging 
species, especially when they live in sympatry. Usually it is assumed that females select 
males according to their displaying traits, but males perform no selection on female traits. 
However, recent experimental evidence accumulates towards the existence of male choice. 
Here we propose a robust mechanism for sympatric speciation, based on the assumption that 
sexual selection operates in two directions: selection of males by females and of females by 
males. Complex mating instabilities emerge, creating differential fitness depending on the 
individuals displaying traits and preferences. When a secondary sexual trait is introduced 
in a population, due to mutations, the activation of previously neutral genes or due to a 
different perception of already existent displaying traits, sympatric speciation may result 
(together with a species recognition system) from a competitive exclusion principle. We 
suggest that potential candidates to test our theory could be yeasts. 
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TEXT 

Traditionally, two main classes of models try to explain speciation. Allopatric speciation 
models are the most consensual and assume that the initial population is suddenly divided 
in two geographically isolated subpopulations, that then diverge until they become repro- 
ductively isolated, even after a secondary contact (Mayr 1963). There is however, mounting 
evidence (Albertson et al. 1999; Panhuis et al. 2001; Schliewn et al. 1994) that speciation 
might have emerged in sympatry (without geographical isolation). Examples like the cichlids 
in lake Victoria or many migratory birds, do not seem to fit the basic requirement of long 
periods of geographical isolation needed for allopatric speciation. Laboratory experiments 
have also shown that in principle, sympatric speciation is possible (Rice & Hostert 1993). 
Understanding how sympatric speciation can be driven, has thus attracted much theoretical 
effort. However, some early works showed that finding a biologically reasonable and robust 
model seems not to be an easy subject (Felsenstein 1981; Maynard Smith 1966). 

Recently, several models have been proposed to explain sympatric speciation. Never- 
theless, no consensus has been reached concerning the main driving mechanism, nor the 
conditions required in practice. One class of models concentrates on sexual competition for 
mates. So far, these models have been the least convincing models (Turelli et al. 2001) as 
they have been incapable of providing a robust mechanism, valid under reasonable initial 
conditions (Takimoto et al. 2000; Turelli et al. 2001). The other class of models assumes 
that competition for resources is essential to create disruptive selection (Doebeli & Dieck- 
mann 2000; Kondrashov & Kondrashov 1999; see a discussion in van Doom & Weissing). 
However it also requires a non-trivial association between traits relevant for ecological adap- 
tation and traits used for sexual discrimination. Further, and in particular in the Doebeli 
and Dieckmann theory, phenotypes using scarce resources may substitute phenotypes using 
more abundant resources, as a result of a strong competition for resources in the later case. 
This is a somehow counter-intuitive result. Again it remains uncertain which species should 
be more speciation prone, in practice. 

Here we propose an alternative approach that leads to a robust theory for sympatric 
speciation. We give a special emphasis to pair formation. Pair formation is an essential 
issue in any sexual population, as it is at the basis of the species reproductive success. As 
we will show, modelling pair formation is essential to create linkage disequilibrium due to 
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the emergence of complex mating instabilities. 

In our model we assume that any individual possesses a preference list, where all individ- 
uals of the opposite sex are ranked, in descending priority order. The goal of any individual 
is to optimise his satisfaction by mating with convenient partners. How the preference list is 
established depends on a response to a sensory stimulus induced by varied secondary sexual 
traits displayed by the opposite sex individuals. In principle, it could also have an ecological 
component, to model, for instance, the preference for certain habitats (Bush 1969). 

As the number of displaying traits can be quite large and perceived differently by differ- 
ent individuals, a simple assumption will be to consider that each individual has his own 
preference list and that this is approximately random. In this way we assume that sexual 
selection has already eliminated non-favoured traits, so that only neutral traits, concerning 
sexual selection, remain. Note that sexual selection does not necessarily mean the selec- 
tion of traits relevant for adaptation. Sexual selection may only improve the fitness of a 
population, by optimising the consequences of the mating conflicts created. 

Sexual conflicts lead to intricate dynamics, as both males and females pursue selfish goals, 
with conflicting wishes. Consider the following preference lists, for a population with three 
males and three females. 
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If all individuals search continuously for better partners, then we can imagine the following 
algorithm. First, males propose sequentially to females ranking higher in their preference 
lists, and females accept the new partners only if they improve their satisfaction. Then 
females and males switch roles. In this case, both males and females adopt active strategies 
towards mating (they both search for better mates). We could sketch that Ml mates Fl, M2 
mates F2, but then F2 divorces M2 and mates with M3 because he is higher in her preference 
list, and so on. It is easy to conclude that no stable arrangement is reached. That is, the 
system has no Nash equilibrium (Gale & Shapley 1962; Omero et al. 1997). While these 
instabilities are not the rule (for instance, a couple can be stable if they prefer each other), 
they produce differential costs for reproduction, as fitness depends on the stability of the 
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couples. There is an algorithm, due to Gale and Shapley (Gale & Shapley 1962; Omero et 
al. 1997), that leads to stable arrangements. This happens if males propose to females, and 
females dispose, accepting to mate new pretenders only if they improve their satisfaction. 
Nevertheless, even if a stable solution exists, the time needed to reach it, tGS, tends to 
be quite large, due to the large number of degrees of freedom and the complex dynamics. 
In practice, the results we will present do not depend on the strategy adopted by females 
towards mating. 



Methods 

We consider discrete generations of a population with an equal number, N tot , of males and 
females. The genetic information on each individual controls several characters. A sexual 
locus may control the strategy towards mating if one would want to study the selection of 
these strategies, but this, we checked, does not change the results concerning speciation. 
Henceforth males and females are assumed to take active and passive strategies towards 
mating, respectively. 

One locus with two alleles (we assume dominance) accounts for a secondary sexual trait 
(A or B). Several independent loci with two alleles (+ or — ) define a quantitative preference 
for one trait. Each individual ranks in a list of preferences individuals of the opposite 
sex. Preference lists are defined with a simple step-like structure. Calling n + the fraction of 
positive alleles, then a positively (negatively) biased arrangement corresponds to a preference 
for mates with trait A (B), such that there is a fraction f v (n+) = [4n+(l — n + )] u /2 of non- 
preferred mates ranked within the first Na (Nb) positions (Na and Nb being the number 
of individuals respectively with phenotype A and B). The positive exponent n controls 
how efficiently individuals with a biased arrangement of alleles discriminate mates with 
preferred traits. For small values of is, deviations from an unbiased arrangement produces 
poor discrimination, as can be seen in figure 1. 

On each iteration, randomly picked individuals (with an active strategy towards mating) 
are given one opportunity to find a better partner. New matings happen only if they improve 
the satisfaction of both elements of the couple. The process repeats until 2N tot offspring are 
born. One offspring of each sex is born if a couple stays together during a courtship time tc, 
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plus a reproduction time tr (results concerning the emergence of speciation do not change 
if offsprings are only allowed to born after a stabilization time). During reproduction time, 
females cannot engage in new matings, but males may find a better partner (in relation to 
speciation, the relevant parameter is simply t c + t r ). Offspring genotypes are found using 
Mendel genetic rules. 

By introducing a probability m of changing each allele controlling the trait of the offspring, 
we can model phenomenologically a many genes dependence on the trait or the effect of 
mutations. A similar procedure can be applied to the loci controlling the preference. 

Finally, ornamentation costs can be introduced within our framework by assigning a birth 
probability, related to the parent's trait. We checked that speciation is not prevented if the 
ornamentation costs are not too different, and thus this point will not be discussed further 
here. 

Results and Discussion 

Consider first a population with only one trait. In figure 2, the distribution function of 
mating lifetimes is shown. It exhibits clear power law decay. This scale invariant behaviour 
means that there is a large amount of stable matings, but also a large amount of very 
unstable short-lived matings, relatively to what one would expect if matings were random. 
Thus pair formation will certainly play a role in a sexual selection theory, as it distinguishes 
individual fitnesses over several scales. In what concerns speciation, another important 
consequence is that couples stability is context dependent: the same couple may have very 
different lifetimes depending on the preferences present in the whole population. 

Note that scale invariant laws are often associated to self-organized criticality (Jensen 
1998). Here, however, it results from the existence of preference lists. It is nevertheless 
conceivable that other alternative rules exist leading to the emergence of a similar scale 
invariant behaviour in a self-organized way. 

For speciation to emerge as a result of sexual selection, a new sexual secondary trait 
must be introduced in a population (due to mutations, or to the activation of certain genes 
(Rutherford & Lindquist 1998), induced by new ecological conditions), or already existent 
displaying traits should be perceived differently under new ecological conditions (for instance, 
water quality or light intensity in a lake (Boughmann 2001)), differentiating the individuals. 
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Thus, a population subject to new ecological conditions may be driven into a sexual selection 
process, from which, we will show, sympatric speciation can be the outcome. 

Consider the limit case in which an individual would rank all the mates with the preferred 
trait first. In general, if a new trait (A or B) is introduced in a population, some individuals 
may prefer a mate with trait A, while others, one with trait B. We can classify males and 
females in four groups, AA, AB, BA and BB, the first letter standing for the trait, the 
second for the preference. If only males adopt active strategies towards mating, then both, 
AA and BA males, compete for the same females, AA or AB. A potential opportunity cost 
exists if, for instance, a BA male mates with an AA female, as there is a high probability 
that this mating will not last enough time to accomplish reproduction. We have checked 
that the mating instabilities reduce strongly the lifetime of couples where traits and prefer- 
ences do not match. From the female's point of view, AA and BA females prefer AA or AB 
males, which produces an indirect competition between A A and AB males. Neglecting the 
effects of recombination and assuming that the average number of males ij, rriij, is equal 
to the average number of females with matched preferences, fji , then grows roughly 
with the probability of forming stable couples, drriij ~ a m?j/m,j, where rrij is the total 
number of males with preference for j. Sexual selection will only keep those individuals 
reproducing at the highest rate, leading to a competitive exclusion principle with a sym- 
metry breaking, where only AAxAA and BBxBB couples or BAxAB and ABxBA, are 
selected. Introducing recombination with one dominant locus for each, trait and prefer- 
ence, this competitive scenario can be tackled analytically as shown in the appendix. Then, 
only A Ax A A and BBxBB couples are selected, which corresponds to the emergence of 
homotypic preferences. 

Our general model considers a quantitative genetic preference, determined by several 
additive independent loci. The number of these loci does not need to be only one, as con- 
sidered in the simple case in the appendix. If we consider a large number of loci controlling 
the preferences, then random configurations produce only small preference tendencies. The 
more biased arrangements of preference alleles are, the larger will be the number of individ- 
uals with the preferred trait ranked in the first positions. As discussed above, the function 
f u (n + ), establishes the relation between bias in preference loci and the intensity of discrim- 
ination. In figure 3a we show how sympatric speciation emerges, starting from a population 
with random genotypes. Even for species with a considerable number of independent genes 
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controlling the preferences, speciation is remarkably achieved within just a few generations. 
Due to trait dominance, only AA or BB individuals last in the end, all heterozygous being 
eliminated. This corresponds to the emergence of two new species and a species recognition 
system. Prezygotic isolation is thus achieved giving way to species divergence. In this ex- 
ample, trait discrimination is the same for males and females (same is). However, for this 
example speciation could also be achieved if we had chosen higher trait discrimination for 
females (higher u, Vf ema i es = 1.5 and u ma i es = 0.5) as it is likely to happen in nature. In 
fact, trait discrimination needs to operate on both sexes, but its intensity may vary. 

If f v {n + ) departs slowly from the unbiased arrangement of alleles corresponding to a 
difficult discrimination, disruptive sexual selection can be prevented (figure 3b). It is also 
possible to prevent speciation if traits are not straightfully inherited, due to complex depen- 
dencies on a large number of other genes (a trait may result from intricate interactions of 
a large number of proteins, leading to a complex inheritance) or as a result of mutations. 
These effects are taken into account through the m parameter. Large values of /x (for the 
example of figure 3a, /i m 6 x 10~ 3 mutations per generation and per allele) reduce the effects 
of the non-random pair formation created by the complex dynamics. Consequently above a 
certain threshold value speciation is prevented and a population with genotype frequencies 
in the Hardy- Weinberg proportions remains. 

Our theory assumes that in a population, each individual presents many phenotypic 
traits, each contributing in a complicated way to establish the other individuals mate choice. 
Many of these traits may not have a strong impact in mate choice, in such a way that a 
phenotypic polymorphism remains in a population (they would be approximately neutral in 
what concerns sexual selection). In figure 4a we show a population in which discrimination 
relatively to a given trait is poor in the first 200 generations. The population consists 
of individuals with both traits and small preferences relatively to both traits. There is 
no relevant correlation between traits and preferences in the individuals. At generation 
200, discrimination for the trait is increased and a quick speciation event emerges. An 
alternative picture is presented in figure 4b. Starting from a monomorphic population and 
sufficiently strong trait discrimination, mutations and a release of natural selection pressure 
(as a result of change of habitat, for instance) allows genetic drift. Preferences acquire 
variability, until individuals with preferences for both traits exist in the population. Then 
sympatric speciation emerges. In practice this later scenario needs more generations to reach 
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the speciation disruptive configuration. 

The theory presented here explains, under minimal assumptions, how sympatric specia- 
tion emerges or not, together with a species recognition system. It is a robust mechanism, 
not requiring special assumptions on the initial species (Doebeli & Dieckmann 2000; Kon- 
drashov & Kondrashov 1999; Lande 1982; Turner & Burrows 1995; van Doom & Weissing), 
on male- female incompatibilities (Gavrilets 2000; Parker & Partridge 1998), on the ecolog- 
ical conditions (Turner & Burrows 1995), or on non obvious connections between sexual 
and ecological traits (Doebeli & Dieckmann 2000; Felsenstein 1981; Maynard Smith 1966; 
Turelli et al. 2001; van Doom & Weissing). Our theory assumes that males and females 
have preferences over the same traits. While female preferences have been extensively stud- 
ied (Andersson 1994; Futuyma 1998), male preferences are usually neglected. Nevertheless, 
there is now mounting experimental evidence for the existence of male choice (Andersson 
1994; Burley et al. 1982; Rutowski 1982; Roulin et al. 2000; Katvala & Kaitala 2001; 
Amundsen & Forsgren 2001). This is actually a natural outcome of natural selection as 
males would otherwise incur in potential opportunity costs: those having the ability to elicit 
healthier females, or predators mimicry, should certainly be favoured. However, in practice, 
they should be more difficult to detect, as males usually adopt active strategies towards 
mating, and thus are seen performing courtship to many females. This is certainly an issue 
deserving more research. 

A unified understanding of sexual selection, starting from simple organisms like yeasts to 
complex organisms like mammals, may be attempted with our model. Indeed, in yeasts it 
is now clear that both mates in a couple choose their partners (Jackson & Hartwell 1990). 
Further, many genes can be involved in the several complex steps of this process (White 
& Rose 2001). Hence, analogues to the preference lists used in our model may already be 
built in. For this reason we propose yeasts as potential candidates to test our theory in 
the laboratory. And if, as predicted, robust sympatric speciation is observed, then many 
implications can result not only on a conceptual level. Indeed we believe that sympatric 
speciation could be used as a general framework for studying the interactions between re- 
ceptors and pheromones, and to construct a better understanding on how these correlate 
with the intracellular state. This issue will be addressed in a forthcoming publication. 
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Appendix: Mathematical approach of the sexual competitive scenario for a 
population with two locus, for traits and preferences, with a dominant allele 

each. 

Here we outline a mathematical approach to the emergence of an exclusion principle 
from our results. The dynamical correlations in the model reduce strongly the fitness of 
non-optimal couples. Assume then that fitness is proportional to the probability of forming 
couples where traits and preferences match. In a two-locus, two-allele model, we have 9 
genotypes, denoted by ij, respectively associated with displaying traits and preferences, 
with i,j = 1,2,3. Genotype 2 is heterozygous, and 1 is a dominant homozygous. Denote 
the number of males (females) with genotype ij by m^- (fy). The evolution of the number 
of genotypes ij is proportional to: drriij ~ Ylkimn ®ij mnm ki fmn/ Im, where fu is the number 
of females with trait corresponding to genotype mn (M = A, B) and af- nin is a matrix 
giving the reproduction frequencies following Mendel rules. This equation is quite general 
but simplifies considerably in the present case, as most entries in the matrix are zero (see 
below). For instance, the equation associated with phenotype BB (genotype m 33 ) is: dm^ 

^ ™33 /33 ISb + ™23 f 3 2 / (4/ B ) + ™32 fa I (4/b) + ™22 ^22 / (16/a). 

An exclusion principle emerges as genotypes try to grow the fastest possible (following 
similar growth equations) while interacting and as we concentrate on populations with a 
fixed number of individuals. Only those growing at the highest rate will remain. 

If we sum all genotypes contributing to the same phenotype, we arrive to similar equations 
to the evolution of the phenotypes. It is possible to check that all terms contributing to 
drriAB = dmi3+drri23 and drriBA = dm^i+dm^ appear in drriAA = dmii+dmi2+dm2i+dm22 
but with larger coefficients in the later. Thus itiab and m B A are eliminated (unless the 
starting population had only genotypes 13 and 31). Phenotypes hiaa an d m BB can coexist 
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as both have a non-mixed contribution corresponding to the union of genotypes m n + / n 

and m 33 + f 33 . 

The non-zero matrix entries are: (afj mn = a™ nkl ) 

1/2 = CH111121 = CH11211 = CK121112 = 0121212 = 0:211121 = 0212121 

1/4 = aini22 = 0111212 = 0111221 = 0112121 = 0121122 = 0121221 = 0121222 = 

= O131212 = O211122 = O211221 = O212122 = O221122 = O221221 = O221222 = 

= O222122 = O222222 = O222332 = O232332 = O312121 = O322332 = O332332 

1/8 = a 1 i 1 222 = O112122 = O122122 = O122222 = O131222 = O211222 = O212222 = O231222 = 

= O232222 = O312122 = O322122 = O322222 

1/16 = 0:112222 = 0132222 = 0312222 = 0332222 

1 = Ollllll = 0221331 — O333333 
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FIG. 1: Preference lists are constructed with a step-like structure. In the first positions there is a 
fraction /(re+) of individuals with non-preferred traits. The parameter n controls how efficiently 
individuals with a biased arrangement of alleles (n+ away from 1/2) discriminate mates with 
preferred traits. The function f u is shown for v = 0.1,0.3(«), l(^) and 4. Dots and diamonds show 
the discrete values of f u for the results of figure 3 (preferences with 30 alleles). 
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FIG. 2: Number of matings N m (t) lasting for t iterations. 500 populations were sampled over 
10000 iterations, for the model without reproduction events (t c = t r = +oo). The distribution 
shows power law behaviour, N m (t) ~ t~ T , with r ~ 2.5. r is independent of the number of 
individuals and the strategies of females. This distribution shows that, due to the correlated 
dynamics, individuals are expected to have fitness varying over several orders of magnitude. Thus 
correlated dynamics must play an essential role on the evolution of the population. 
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FIG. 3: Evolution of phenotype frequencies starting from a population with random initial geno- 
types. One dominant locus with two alleles controls the trait, and 15 independent loci with two 
alleles each, the preference intensity. For n+ close to 1(0), mates with trait A (B) tend to be pre- 
ferred. Sympatric speciation a quickly emerges if sufficient discrimination of mating traits exists 
[y = 1.0), or b is prevented if discrimination is insufficient {y = 0.3) {N to t = 200, t c = t r = 40). 
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FIG. 4: Evolution of phenotype frequencies (one dominant locus controls the trait; 15 independent 
loci, the preference intensity): a starting from a population with random initial genotypes, \i = 
3 x 10 per allele, and weak trait discrimination (y = 0.4). At generation 200 discrimination is 
increased = 1.0). b starting from a monomorphic population (all individuals with trait B and 
all their preference loci equal to - preference for B traits) and discrimination v = 1.0. Mutations 
(n = 6 x 10~ 4 per allele) produce genetic drift such that speciation may emerge when there are 
individuals with preferences for both traits. Light squares correspond to phenotypes with more 
than 10 individuals. 



